Efficient Treatment of Failures in RPC Systems

نویسندگان

  • Valérie Issarny
  • Gilles Muller
  • Isabelle Puaut
چکیده

This paper addresses extensions to be made to a basic remote procedure call system for the integration of primitive fault tolerance measures. Our main design goal is to not introduce performance penalty for remote procedure calls executing in the absence of failures, and to not impose signiicant overhead by the treatment of failures. Basically, extensions include a simple algorithm that nds and eliminates orphans, and a mechanism that detects abnormally terminated remote calls. Our solution for orphan detection is based on the extermination approach, its eeciency coming from a minor addition to the system architecture that allows the implementation of high speed stable storage. Performance measures given by the implementation of our reliability mechanisms on top of the Mach 3.0/BSD UX36 operating system show that the mechanisms are responsible for adding only 1% overhead on the operating system's base remote procedure call. Traitement EEcace des D efaillances dans les Syst emes a Appels de Proc edure a Distance R esum e : Ce document propose des extensions a apporter a un syst eme a appels de proc edures a distance pour tol erer les d efaillances des nnuds et des liens de communication. Notre objectif principal est de ne pas entrainer de d egradation des performances en l'absence de d efaillance, et de ne pas introduire de surco^ ut prohibitif pour le traitement des d efaillances. Les extensions propos ees incluent un m ecanisme de d etection et d' elimination des ex ecutions orphelines ainsi qu'un m ecanisme de d etection des terminaisons anormales d'appels de proc edure a distance. Le m ecanisme propos e pour la d etection des ex ecutions orphelines est bas e sur l'extermination des orphe-lins. L'eecacit e de ce m ecanisme repose sur un ajout mineur a l'architecture sous-jacente qui permet la r ealisation d'une m emoire stable a acc es rapide. Des mesures de performance, obtenues en int egrant les deux m ecanismes pro-pos es au syst eme d'appel de proc edure a distance du syst eme d'exploitation Mach 3.0/BSD UX36, montrent que les m ecanismes introduisent seulement un surco^ ut de 1% par rapport a l'appel de proc edure a distance de ce syst eme en l'absence de d efaillance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Strategy for Power Rating Reduction of Back-to-Back Converters Used in Railway Power Conditioner

One of the most common solutions to deal with the power quality issues of electrical railway systems is employing Railway Power Conditioner (RPC). In the conventional mode of operation, the RPC power rating deployed in every Traction Power Substation (TPS) is significantly large. This paper mainly focuses on the power rating reduction of RPC through presenting a novel combinatorial compensation...

متن کامل

MTBF evaluation for 2-out-of-3 redundant repairable systems with common cause and cascade failures considering fuzzy rates for failures and repair: a case study of a centrifugal water pumping system

In many cases, redundant systems are beset by both independent and dependent failures. Ignoring dependent variables in MTBF evaluation of redundant systems hastens the occurrence of failure, causing it to take place before the expected time, hence decreasing safety and creating irreversible damages. Common cause failure (CCF) and cascading failure are two varieties of dependent failures, both l...

متن کامل

Communication Primitives for Ubiquitous Systems or RPC Considered Harmful

RPC is widely used to access and modify remote state. Its procedural call semantics are argued as an efficient unifying paradigm for both local and remote access. Our experience with ubiquitous device control systems has shown otherwise. RPC semantics of a synchronous, blocking invocation on a statically typed interface are overly restrictive, inflexible, and fail to provide an efficient unifyi...

متن کامل

RPC Under Fire

I n 1976, James E. White of the Stanford Research Institute published details in RFC 707 about what would come to be called the remote procedure call (RPC).1 He described his “procedure call model” as a way of making the networked environment seem completely familiar to application developers, rather than exposing the network directly to them and thus presenting them with a development model so...

متن کامل

Fast, Optimized Sun RPC Using Automatic Program Specialization

Fast remote procedure call (RPC) is a major concern for distributed systems. Many studies aimed at efficient RPC consist of either new implementations of the RPC paradigm or manual optimization of critical sections of the code. This paper presents an experiment that achieves automatic optimization of an existing, commercial RPC implementation, namely the Sun RPC. The optimized Sun RPC is obtain...

متن کامل

RPC Chains: Efficient Client-Server Communication in Geodistributed Systems

We propose the RPC chain, a simple but powerful communication primitive that allows an application to reduce the performance effects of wide-area links on enterprise and data center applications that span multiple sites. This primitive chains together multiple RPC invocations so that the computation can flow from one server to the next without involving the client every time. We demonstrate tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994